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I. INTRODUCTION 

Since its introduction in 1948, the Shannon entropy jJI has 
played a central role in all branches of communication the- 
ory, where it allows a precise and operational definition of 
useful concepts Uke transmission rate and channel capacity, 
and has found many important applications in other branches 
of science as well |2, 3]. Shannon introduced it to quantify 
the amount of "uncertainty" or "choice" present in a prob- 
ability distribution for a set of messages or symbols. In 
this regard Jhis entropy possesses some mathematical prop- 
erties Iliy,bllal2lla.l2ll that correspond to intuitive properties 
expected from a measure of uncertainty. 

In a recent paper |10], however, Brukner and Zeilinger 
analyse the appUcation of the Shannon entropy to the statis- 
tical outcomes of two simple quantum-mechanical thought- 
experiments and of a non-quantum-mechanical, or 'classical', 
one, and conclude that two particular properties of the Shan- 
non entropy are violated in quantum experiments while hold- 
ing in classical ones. 

This conclusion poses serious limitations when we wish to 
use the Shannon entropy, because of its particular properties, 
in quantum-mechanical problems. It would be therefore rea- 
sonable to ascertain in which sense or to what extent (if any) 
the claimed violations occur 

Deep criticisms of the conclusions of Ref. jlOll have al- 
ready appeared in the Uterature, mainly Hall's II III and Timp- 
son's fvi,^]3\. These interesting criticisms are mainly of a 
theoretical (and partly philosophical) nature, and so do not 
explain whether or how the concrete and interesting thought- 
experiments given in Ref. fid] really represent a violation of 
the Shannon entropy's properties. 

The purpose of the present paper is more pragmatic: to re- 
analyse the experiments presented in Ref. 1 10] and to show by 
explicit calculation that they present in fact no violation of any 
property of the Shannon entropy. 

We shall see that the seeming "violations" appear only be- 
cause the authors of Ref. 1 10] tacitly shift between different 
contexts (different experimental arrangements), thus misap- 



plying Shannon's formulae. In order to explain what is meant 
here by 'context', and to make the last point clearer, let us 
consider the following example: ' 

In another Wonderland, Alice is acquainted with two 
Cheshire Cats: a Frumious one and a very Mome one. The 
Frumious Cheshire Cat always appears with its tail ij) first, 
and then with the rest {R) of its body (head included), and in 
these apparitions Alice always observes first only that T - 
2 cm, and then that R - 3 cm, hence 

"r + /? = 5cm". (1) 

The Mome Cheshire Cat, instead, always appear with its rest 
{R) first, and then comes the tail (T), so that, when meeting 
it, Alice can first observe that R - 3Q cm, and then that T - 
20 cm; hence 

" + r = 50 cm ". (2) 

Looking at Eqs. Q and Alice concludes: 

''T + Ri^R + T", (3) 

or: "the commutative property of addition is violated for 
Cheshire Cats". 

Is Alice's conclusion about the commutative property cor- 
rect? No. First of all, we see that the content of Eq. ([ij is 
really just "2 cm + 3 cm - 3 cm + 2 cm - 5 cm", and analo- 
gously for Eq. (13, hence the commutative property is in fact 
clearly satisfied in those equations. 

On the other hand, what Alice means by Eq. Q is just 
"2 cm + 3 cm 30 cm -i- 20 cm", so that this equation has 
little to do with the commutative property of addition. In fact 
Eq. Q, as it is written, makes simply no sense, because the 
various mathematical symbols therein are inconsistently used: 
they have different values on the right- than on the left-hand 
side. 

This comes about because Alice tacitly shifts from a given 
context, where the symbols T' and 'R' refer to the Frumious 
Cheshire Cat and have some values, to a different context. 
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A less informal example is presented in the published version of this paper, 
Phys. Rev. A 69, 062108 (2004). 
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where the same symbols refer to the Mome Cheshire Cat and 
have different values — but the commutative property of ad- 
dition is not meant to apply that way. Related to this is the 
fact that Alice seems to attribute a temporal meaning to the 
order in which the summands appear in the additions Q-Q, 
but we know that the order of terms in an addition has no tem- 
poral meaning. If Alice wishes to express temporal details, 
she can do so by, e.g., appending subscripts to the symbols, 
like Tf, R,i', etc.; in any case she cannot burden the addition 
operation with meanings that the latter does not have. 

Hence, Alice's incorrect conclusion stems from her using, 
inconsistently, the same symbols T and R for two different and 
incompatible logical and experimental contexts. She could 
have used, e.g., TMome/s ^Frum.fs etc., instead, writing her 
findings as 

^Mome,/' + ^Mome.f" — ^Mome.f" + TMome/' 

^Frum,/" "t" ^Fmm,/' ~ ^Frum/' "t" ^Frum.r" ? (4) 

where all symbols are consistently and informatively used and 
it is clear that no violations of any mathematical property oc- 
cur 

In nuce, an analogous /flMJc pas is made in Ref. fl^, al- 
though less apparent, because there instead of the addition we 
have more complex mathematical functions — probabilities 
and entropies — , and quantum experiments take the place of 
the example's Cheshire-Cats experiments. In the mentioned 
paper, the authors compare probabilities and Shannon en- 
tropies pertaining to different contexts (different experimental 
arrangements); thus their equations do not pertain to the prop- 
erties they meant to discuss. To my knowledge, this simple 
point has hitherto never been noticed in the literature, and will 
be shown here by explicit calculation. 

We shall also investigate how the peculiar reasoning in 
Ref. fl^ comes about, and find that it stems from an improper 
temporal interpretation of the logical symbols '|' and 'A'. This 
point has not been noticed in the literature either.^ 

It is also clear that such kinds of seeming "violations", aris- 
ing through mathematical and logical misapplication, can then 
easily be made to appear not only in quantum experiments, but 
in classical ones as well, and this will also be shown by simple 
examples. 

For these purposes, some notation and definitions for prob- 
abilities and the Shannon entropy will first be introduced, 
where the role of the context — e.g., the given experimen- 
tal arrangement — is emphasized. Then, after a discussion of 
the two questioned properties of the entropy, the experiments 
of Ref. 1 10] will be presented and re-analysed. Finally, two 
"counter-experiments" will be presented. 



^ In fact, even Timpson ll2L p. 15] seems to attach an improper temporal 
sense to the logical concept of 'joint probability' . 



n. DEFINITIONS AND NOTATION 

We shall work with probability theory as extended logic Q, 
, so we denote by P{a \ I) the probability of the proposi- 
tion a given the prior knowledge I. This prior knowledge, also 
called (prior) data or prior information, represents the con- 
text the proposition refers to. It is useful to write explicitly 
the prior knowledge — a usage advocated by Keynes ilSjl. 
Jeffreys [17, 19], Koopman ]20, 21, 22], Cox 17, 23II2I, 
Good fol . and Jaynes jl^ — because the meaning and the 
probability of a proposition always depend on a given context. 
Compare, e.g., P('Tomorrow it will snow" ] "We are in Stock- 
holm and it is winter") and P('Tomorrow it will snow" ] "We 
are in Rome and it is summer").^ Even more, the probabil- 
ity of a proposition can be undefined in a given context (i.e., 
that proposition is meaningless in that context): consider, e.g., 
P("My daughter's name is Kristina" ] "I have no children"). 
Common usage tends to omit the context and writes P(a) in- 
stead of P{a \ I), but this usage can in some cases — especially 
when the context is variable, i.e., several different contexts are 
considered in a discussion — lead to ambiguities, as will be 
shown. 

In particular, a proposition can represent an outcome of an 
experiment or observation that was, is, or will be, performed. 
The context in this case consists of all the details of the exper- 
imental arrangement and of the theoretical model of the latter 
which are necessary to assign a probability to that outcome. 
This is true for both "classical" and "quantum" experiments. 

The importance of the context is also evident when consid- 
ering a set of propositions 

A = {fl,-: i - 1, . . . , Hfl} (5) 

such that, in the context /, they are mutually exclusive and ex- 
haustive (i.e., one and only one of them is true). Indeed, it is 
clear that the propositions may be mutually exclusive and ex- 
haustive in some context while not being such in another. For 
example, the propositions {"A red ball is drawn at the first 
draw", "A wooden ball is drawn at the first draw") are mutu- 
ally exclusive and exhaustive in the context "Balls are drawn 
from an urn containing one red plastic ball and one blue 



^ The relevance of prior knowledge is nicely expressed in the following 
brief dialogue, which I could not resist quoting, between Holmes and Wat- 
son l2dl . taking place after the two listened to a client's statement: 

"1 think that 1 shall have a whisky and soda and a cigar 
after all this cross-questioning. I had formed my conclusions 
as to the case before our client came into the room." 

"My dear Holmes!" 

"[. . . ] My whole examination served to turn my conjecture 
into a certainty. Circumstantial evidence is occasionally very 
convincing, as when you find a trout in the milk, to quote 
Thoreau's example." 

"But I have heard all that you have heard." 

"Without, however, the knowledge of pre-existing cases 
which serves me so well." 
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wooden ball", but are neither mutually exclusive nor exhaus- 
tive in the context "Balls are drawn from an urn containing 
one red wooden ball, one red plastic ball, and one blue 
plastic ball", and make no sense in the context "Cards are 
drawn from a deck". 

In the following, propositions which are mutually exclusive 
and exhaustive in a given context will be called alternatives, 
and their set a set of alternatives ^ Such a set may represent 
the possible, mutually exclusive outcomes in a given experi- 
ment. 

The probability distribution for a set of alternatives {a, : / - 
1, . . . , «a} in the context I will be denoted by 



P(A|/) = {P(fl,|0: /=!,...,««) 
and has, of course, the usual properties 

P(fl,- A a,v I = fori^t;'. 



(6) 

(7) 
(8) 



where the conjunction i28l Eol lioll (sometimes also called 
'logical product') of a and b is denoted hy a Ab. 

The Shannon entropy f?] is a function of the probability dis- 
tribution for a given set of alternatives A in a given context I. 
It is defined as 

H[PiA I 0] = -K ^ P(ai I In | /), > 0, (9) 

with the usual conventions for the units in which it is ex- 
pressed and the positive constant K, and the limiting proce- 
dure when vanishing probabilities are present.^ The Shannon 
entropy can be considered to quantify the amount of "uncer- 
tainty" associated with a probability distribution.'' 

Given two sets of alternatives A and B in the context I, the 
set of all conjunctions of the propositions {a, } with the propo- 
sitions {bj]. 



A A B = {fl; A /jj! i — 1, ...,«„; y = 1, . . . , nfc). 



(10) 



is also a set of alternatives in /, as follows from the rules of 
probability theory; it will be called a composite set? Thus the 
composite probabiUty distribution 



P{AAB\D^{P(aihbj\iy. i^\,...,na;j^\,...,nb} 



(11) 



Another name for such a set could be 'event' (cf. Tribus |i27l|); Cox uses 
the terms '(inductive) system' |7] and (with a sUghtly more general mean- 
ing) 'question' (24I1 . 

5 Cox |7,2| gives a generalisation of Shannon's formula which is valid for 
a general set of not necessarily mutually exclusive propositions. 

* A very small sample of the literature on Shannon's entropy is represented 
by the work of Kullback |31], Renyi Aczel et al. 00], Wehrl 01, 
Shore et al. L33-1 . Jaynes 1 16|; see also Forte l34l . 

^ I prefer the term 'composite' to the term 'joint', as the latter seems to bring 
about improper temporal meanings. 



satisfies the properties 

P((fl; A bj) A (ai A bf) 10 = for / i' or ; 9^ /, (12) 
Y^P{aiAbj\l)^\. (13) 



The probability distributions for the sets of alternatives A, 
B, and A A B in the context I are relate d by the following stan- 
dard probability rules (see e.g. Jaynes llm Ch. 2], Jeffreys 
Ch. I]|ll9l Ch. 2], Cox 0, Ch. 1]) 



P(fl,IO = Xi^^'''^^>l'^' 

j 

P(fc,|/) = 2P(fl,A/7,.|0 

(marginal probabilities), and 

P(fl/ A bj I = P(fl/ \bjM) P(bj I 
= P{bj I fl; A P(ai I /) 

(product rule), from which 

P(ai A bill) 

I ^ = I ' if P(bj 10^0, 

Pibj I 

P(ai A bill) 
P(bj I fli A = , ' if Piai 10^0 

P(ai I 



(14a) 



(14b) 



(14c) 



(Bayes' rule) follow. 

All the probability distributions for the sets of alternatives 
A, B, and A A B have associated Shannon entropies. It is also 
possible to define the conditional entropy of the distribution 
for B relative to the distribution for A, as follows: 

H[P(B I A A 0] = I '^"^^^^ I ^ 

= 2 P(ai I /) 2 P(bj I Oi A In Pibj I Oi A /). (15) 

' 

An analogous definition is given for H[P(A \ B A l)]. 



III. PROPERTIES OF THE SHANNON ENTROPY AND 
QUANTUM EXPERIMENTS 

The various Shannon entropies presented in theprevious 
section possess several mathematical properties fll|4|,|5l|3,l2l 
0, 9] which have a fairly intuitive meaning when the Shan- 
non entropy is interpreted as a measure of "uncertainty". We 
shall focus our attention on two of these properties which, it 
is claimed 1 10], are violated in quantum experiments. 

Consider the probability distribution for a composite set of 
alternatives A A B in a context /. The entropy H[P(B \ I)] and 
the conditional entropy H[P(B | A A /)] satisfy 



//[P(B|/)] >//[P(B|AaO]. 



(16a) 
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This property (also called concavity since it arises from the 
concavity of the function -xlnx), intuitively states that the 
uncertainty of the probability distribution for B in the context 
I decreases or remains the same, on average, when the context 
is "updated" because one of the {a,) is known to be true. An 
intuitive picture can be given as follows. Imagine that some- 
one performs an experiment (represented by I) consisting in 
two observations (represented by the sets A and B), and writes 
the results of the experiment in an observation record, say, on 
a piece of paper denoted by T, under the headings 'A' and 
'B\ Now suppose that we know all the details of the experi- 
ment except for the outcomes: we have not yet taken a look at 
the record and are uncertain about what result is written under 
'B\ If we now read the result written under 'A', our uncer- 
tainty about the result under 'B' will decrease or remain the 
same on average (i.e., in most, though not all, cases), since the 
former result can give us some clues about the latter. 
The second property (strong additivity 01) reads 

H[P{A A B I 0] = H{PiA I 0] + H{PiB | A A /)] 

= H[P{B\r)\+H{P{A\BM)], (16b) 

and its intuitive meaning is that the uncertainty of the proba- 
bility distribution for the composite set of alternatives A A B 
is given by the sum of the uncertainty of the probability distri- 
bution for A and the average uncertainty of the updated prob- 
ability distribution for B if one of the {a, } is known to be true, 
or vice versa. Using the intuitive picture already proposed, 
we are initially uncertain about both outcomes written on the 
record This uncertainty will first decrease as we read the 
outcome for 'A', and then disappear completely when we read 
the outcome for 'B' (given that we do not forget what we have 
read under 'A'). Equivalently, the total uncertainty will first 
decrease and then disappear as we read first the outcome un- 
der 'B' and then the one under 'A'. 

Three remarks are appropriate here. The first is that the 
above are mathematical, not physical, properties: they are not 
experimentally observed regularities, but rather follow math- 
ematically, through the properties of basic arithmetical func- 
tions like the logarithm, once we put some numbers (the prob- 
abilities P{a\ I /), P{b2 I 0^ Pibi I «i A I), etc.) into the formu- 
lae (|9jl and M5\ . In particular, they can be neither proven nor 
disproved by experiment, but only correctly or incorrectly ap- 
plied. It is for this reason that one suspects that the seeming 
violations found in Ref. 1 10] are only caused by an incorrect 
application of the formulae M6V . we shall see that this is the 
case, because in the mentioned paper some probabilities are 
used in the right-hand sides of Eqs. jl6> . and (numerically) 
different probabilities in the left-hand sides. 

This leads us to the second remark: the above properties 
hold when all the probabilities in question refer to the same 
context; otherwise, they are not guaranteed to hold. This is 
because if a specific set of alternatives, in two different con- 
texts, has two different sets of probabilities, then it will yield, 
in general, two different entropies as well. Hence we can- 
not expect the formulae M6\ to hold if we use, in the left- 
hand sides, the probabilities relative to a given experiment. 



and, in the right-hand sides, those relative to a different exper- 
iment. This remark amounts simply to saying that the expres- 
sion 'x+y - y+x' holds only if we give x and y the same value 
on both sides (cf. Alice's example in the Introduction). The 
(somewhat pedantic) notation H[P{A \ I)] and H[P{B \ A A 0], 
instead of the more common H{A) and H{B \ A), is used here 
to stress this point. 

The third remark (cf. also Koopman ji^l) is that an expres- 
sion like 'B I A' does not imply that "the observation repre- 
sented by A is performed before the one represented by B"; 
nor does 'A A B' mean that the two observations are carried 
out "simultaneously". The temporal order of the observa- 
tions is formally contained in the context /, and the symbols 
'I' and 'A' have a logical, not temporal, meaning. One must 
be careful not to confuse logical concepts with mathematical 
objects or physical procedures, even when there may be some 
kind of relationship amongst them; this kind of confusion is 
strictly related to what Jaynes called "the mind projection fal- 
lacy" dlHEl- 

Consider, e.g., the following situation: an urn contains a 
red, a green, and a yellow ball, and we draw two balls with- 
out replacement (this is our context /„). Consider the propo- 
sitions a " "Red at the first draw", and b = "Green at the 
second draw". Now, suppose that we also know that the sec- 
ond draw yields 'green'; what is the probability that the first 
draw yields 'red' ? The answer, to be found in any probability- 
theory textbook, is P{a \ b Ah) - P(a A b \ lu)/P(b \ lu) - 
(1/3) X (l/2)/(l/3) = 1/2. We see that the expression 'a\b' 
does not mean that b precedes a (nor does it mean that b 
"causes" a): it just expresses a logical connexion. We can 
also ask: what is the probability of obtaining ^rsf 'red' and 
then 'green'? The answer is again standard: P(aAb\lu) - 1/6. 
We see that the expression 'a A b' does not mean that a and b 
happen simultaneously: it just expresses a logical connexion.^ 

Note that Shannon himself, in his paper, does not attribute 
any temporal meaning to the conditional or joint entropies, 
but only the proper logical one. He uses, e.g. (11 §§11 and 
12], the expressions H[P(X\YAl)] and H[P(X AY \l)] (which 
in his notation are '//y(x)' and 'H(x,yy \1, §6]) where '}" 
represents the received signal, while 'X' represents the signal 
source — i.e., where X actually precedes F.' We may also re- 
mark that Shannon's entropy formulae for channel character- 
istics are not directly dependent on the actual physical details 
of the system(s) constituting the channel, which can be classi- 



We can change the example by considering, instead, the order in which 
three identically prepared radioactive nuclei, in three different laboratories, 
will decay. The analysis and the conclusion will nevertheless be the same. 
'Quantumness' plays no role here. 
' From Shannon's paper: "First there is the entropy H(x) of the source or of 
the input to the channel [...]. The entropy of the output of the channel, 
i.e., the received signal, will be denoted by H{y). [. . . ] The joint entropy of 
input and output will be H(x, y). Finally there are two conditional entropies 
Hy(y) and Hy{x), the entropy of the output when the input is known and 
conversely. Among these quantities we have the relations H(x, y) = H(x) + 

//,,0') = ffO') + ff>W'kit§ii]. 
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cal, quantum, or even made of pegasi and unifauns. Only their 
statistical properties are directly relevant. 

Hence, the conjunction a,- A bj = bj A (and B A A = A/\B) 
is commutative even if the matrix product of two operators 
which can in some way be associated with these propositions 
is not. 

We shall see that these remarks have indeed a bearing on 
the analysis presented in Ref. | lOJ, where the validity of the 
formulae M6\ was analysed in two quantum experiments and 
a classical one, and it was found that, seemingly, Eqs. did 
not hold in the quantum case, while they still held in the clas- 
sical one. These experiments will be now presented using first 
their original notation of Ref. 1 10] (indicated by the presence 
of quotation marks), which makes no reference to the context, 
and then re-analysed using the expanded notation introduced 
above. 



A. First quantum experiment 

The first quantum experiment |10, Fig. 4] runs as fol- 
lows. Suppose we send a vertically polarised photon through 
a horizontal polarisation filter. The set of alternatives B = 
{^out, ^not-out} refers to the photon's coming out of the filter, 
with boux =■ "The photon comes out of the horizontal filter", 
^not-out = "The photon does not come out of the horizontal 
filter". Since we are sure about ^not-out, i-e., ''P{baoi-oud - 1"> 
we have that 



H{B) = ' 



Then we insert a diagonal (45°) filter before the horizontal 
one; the set of alternatives A refers to the photon's coming out 
of the diagonal filter, with flout and flnot-out defined analogously. 
Now, if we know that the photon has come out of the diagonal 
filter, we are no longer sure that it will not get through the 
horizontal filter, and so the uncertainty on B is increased by 
knowledge of A: 



"i/(B|A)>0". (18) 



Thus we find 



" = H(B) < H{B I A) " (seemingly) (19) 

and property ( I16a> is seemingly violated. 

Only seemingly, though. The fact that a diagonal filter was 
considered in the reasoning leading to Eq. ( I18t but not in the 
reasoning leading to Eq. Ml\ . makes us doubt whether the 
comparison of the entropies "H(B)" and "H{B | A)" really cor- 
responds to Eq. ( I16a> : these entropies, in fact, apparently refer 
to two different experimental arrangements — i.e., to two dif- 
ferent contexts. It becomes evident that this is indeed the case 
if we proceed by calculating numerically and explicitly all the 
probabilities first, and then the entropies, keeping the context 
in view. 

Let us consider again the first experimental set-up, which 
will be denoted by /one, where only one, horizontal, filter is 



present. For the set B, we have of course that, using our nota- 
tion. 



Pib^^X I Ion,) = 0, 



P(bn 



/o„e)= 1, 



SO that the Shannon entropy is 

H{P(B\l,,,)]^H{OA)^Oh\i. 



(20) 



(21) 



At this point, asking for the probabilities for the set of alter- 
natives A, we realise that the propositions flout = "The photon 
comes out of the diagonal filter" and flnot-out = "The photon 
does not come out of the diagonal filter" make no sense here, 
since no diagonal filter is present; consequently, there exist no 
entropies like H[P{A \ /one)] or H[P{B \ A A /one)]- So in this 
experimental set-up it is not even meaningful to consider the 
property ( I16a> . 

When we insert a diagonal filter before the horizontal one, 
we have a new, different experimental arrangement, which 
will be denoted by /two- In this new set-up it does make sense 
to speak of both sets A and B.'" Quantum mechanics yields 
the following probabilities: 



^(^not-out I ^two) 
2' 



^ 'two) — ^1 



P(.O.o\H I 'two) — 2 

P{boux I flout 

^'(''not-out I 'J'out ^ 'two) — 5; 
P(bout I flnot-out ^ 'two) — 0, 
^(^not-out I flnot-out ^ 'two) — 1 ■ 



(17) whence, by the product rule, it follows that 



Pibout I 'two) 



I 'two) 



(22) 
(23) 

(24) 
(25) 



At this point we notice that the probabilities for the set B, 
Eqs. ( I25> , differ numerically from those calculated in the pre- 
vious set-up, Eqs. j20> . This difference forces us to take note 
of the difference of the set-ups /one and /two; if we ignored this, 
inconsistencies would arise already for the probabilities, even 
before computing any entropy. 

The Shannon entropy for the probability distribution for the 
set B appropriate in this context is readily calculated: 



H[P(B|'two)] = //(i|)^0.81bit, 



(26) 



In the original foimulation of the example fl(f\, the authors denote "by A 
and B the properties of the photon to have polarization at -1-45° and hori- 
zontal polarization, respectively"; so that A should perhaps be defined as 
{"The photon has diagonal (45°) polarisation", "The photon has no diag- 
onal (45°) polarisation"), and B analogously. However, there are problems 
with these propositions. If the photon is absorbed by the diagonal filter, 
then it makes no sense to say that the photon has no diagonal polarisation, 
since the photon is no longer present (note that this problem has nothing to 
do with the non-existence of properties of a system before an observation: 
the point is that, if no photon is present, then it does not make sense to 
speak about its properties anyway). For this reason the alternative proposi- 
tions flout, ^not-oiii. etc., have been used here; however, this has not affected 
the point of the experiment in Ref. Ildl — namely, the seeming violation 
of property I16al . 
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and we see that, in fact, it differs from the one in the first 
set-up: thus, in the first quantum example of Ref. \ 10] the ex- 
pression "H(B) " is inconsistently and ambiguously used. 
The conditional entropy relative to A is: 

H[P(B I A A /two)] = 0.5 bit, (27) 

and, as a consequence, 

0.81 bit « H[P(B I 4„o)] > H[PiB\AA /two)] = 0.5 bit, (28) 

in accord with the property (I16a> . 

So no violations of the property ( I16a> are found here. Equa- 
tion il9i . from Ref. fioll . is simply incorrect, and the seeming 
"violations" that arose from it disappear at once if we write 
that equation more correctly as 



H[P{B\k„,)]<H[P(B\AAl,^,)], 



(29) 



where we see that the left-hand side refers to the set-up Zone, 
whereas the right-hand side refers to the different set-up /two, 
so that the comparison is between entropies relative to dif- 
ferent experiments, and the equation does not concern prop- 
erty (116a) . This fact escaped the attention of the authors of 
Ref. 1 10], partly because the contexts were not explicitly writ- 
ten, and partly because the probabilities were not explicitly 
calculated (so that the authors did not notice that the probabil- 
ities for B had two numerically different sets of values). The 
authors also seem to interpret the expression "H{By' as im- 
plying that no observation must precede B, and the expression 
"H{B I A)" as implying that the observation relative to B must 
be preceded by the one relative to A. As already remarked, 
this needs not be the case. 

It should be noted that the experimental arrangements Zone 
and /two are incompatible, also in the formal sense that their 
conjunction /one A /two is false. We could erroneously see /two 
as a "more detailed" description of /one, equivalent, for exam- 
ple, to the conjunction of /one and the proposition "Moreover, 
a diagonal polarisation filter is present between the photon 
source and the horizontal filter". But this is not the case: /one 
states that nothing is present between the source and the hori- 
zontal filter; if it had been otherwise, and /one had left open the 
possibility that something unknown could be between source 
and filter (a linear or circular polarisation filter, or a mirror, 
or an opaque screen, or something else), then we should have 
assigned a different state to the photon reaching the horizontal 
filter, and the calculation of the probabilities would have been 
very different ITtIi . 



B. Second quantum experiment 

The second experiment fid, Fig. 5] is as follows. We send a 
spin- 1/2 particle with spin up along the z axis through a Stern- 
Gerlach apparatus aligned along the axis a that lies in the xz 
plane and forms an angle a with the z axis. Let us denote by 
A the set {a^p, fldown] with a^p = "The particle comes out with 



spin up along a" and Odown = "The particle comes out with 
spin down along a". We have the following probabilities, in 
the notation of Ref. ifioll : 

"P(flup) = cos2f, P(fldown) = sin^ f ", (30) 

and a corresponding Shannon entropy which amounts to 

" H(A) = //(cos^ f , sin^ f ) ". (31) 

The particle then proceeds to a second Stern-Gerlach appara- 
tus aligned along the x axis; let us denote the corresponding 
set of alternatives by B = {b^p, ^downl, where bap and fedown are 
defined analogously to a^p and Odown above. The conditional 
probabilities for B relative to the outcomes for A are: 



" P(bap I fl,p) = cos2(f - f) ", 
" P(bio«n I flup) = sin^d - f ) ", 

" m,p|fldown) = sin2(2-|)", 
" ^'(^down I fldown) = COS^(f - f ) ", 



(32) 



(33) 



and we can easily calculate the following Shannon conditional 
entropy: 

" H(B I A) = cos^ f X //(cos^ f , sin^ f ) 

+ sin^ f X //(sin^ f , cos^ f ) (34) 
= //(cos2f,sin2f)". 

The sum of the entropies thus far calculated is 

" H(A) + H(B I A) = 2 //(cos^ f , sin^ f ) ". (35) 

Now we suppose to exchange the two Stern-Gerlach appa- 
ratus, putting the one along x before the one along a. We then 
find the following probabilities for B: 



"P{bap)^\, P(/7down)=i 

with the associated Shannon entropy 

" //(B) = //(i,i)". 



(36) 



(37) 



The conditional probabilities, given by quantum theory, for 
the set A relative to the outcomes for B are: 



" P{aap I feup) = sin2(f - f ) ", 
" ^(fldownlM = cos2(f -f)", 

" Pia^p I Z^down) = COs2(2 - f ) ", 
" i'(fldown I ^down) = ^^Aj " f ) "> 



(38) 



(39) 



and together with the probabilities ( I36> they lead to the condi- 
tional entropy 

" Hi A I B) = sin^ f X H(sin^ f ,cos^ f ) 

+ cos^ f X H(cos2 f , sin^ f ) (40) 
= //(cos2f,sin2 2)". 
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The sum of the entropies (I37> and (I40> now yields 

" H{B) + H(A I B) = //(i, i) + H{cos^ f , sin^ f ) ", (41) 

but this is in general (e.g. for a = Ji/4) different from the 
sum ( I35> , and thus we find that, in general, 

" H{A) + H(B I A) + H{B) + H(A \ B) " (seemingly), (42) 

in seeming contradiction with the property (I16b> . 

However, we notice that two different relative positions of 
the Stern-Gerlach apparatus were considered, in order to ar- 
rive at Eqs. ( I35> and (I41> respectively. This makes the seem- 
ing contradiction above just an artifact produced, again, by 
the comparison of Shannon entropies relative to two different 
experimental arrangements, analogously to what happened in 
the experiment with photons previously discussed. Also in 
this case, this is shown by an explicit calculation of all the 
probabilities relative to the experiments. 

In the first set-up, which can be denoted by m, we send 
the spin- 1/2 particle to the Stern-Gerlach apparatus oriented 
along a (associated to the set of alternatives A), which is in 
turn placed before the one oriented along x (associated to the 
set B). Basic quantum-mechanical rules yield the following 
probabilities (in our notation): 



P(aup I m) = COS^ |, P(fldownl'«) 

PQy^P I flup A m) = cos^Cf - f ), 
^(^down I flup A m) = sin^(f - f ), 

^'(^up I fldown A m) = sin2(f - f ), 

P{bdova I fldown A m) = COS^(f - f ), 



sm 



2 a 



(43a) 
(43b) 

(43c) 



and these are sufficient to calculate, by the product rule ( I14bl l. 
the joint probabilities 



Pab = ^^(flup A b^p I m) = cos 2 — ' 

Pah = P(aup A fcdown I m) = COS 
Pah = 



'ah = P(a, 
P(ad< 

lown C 



■ cos2(f - 

2 . Qin^Ci 



cos^ f ■ sin^Cf - f ), 
iown A /7up I m) = sin^ f ■ sin^(f - f ; 



-) 

iown I »^) = ""^ 



-■ sin^ f ■ cos^d - f ). 



(44a) 
(44b) 
(44c) 
(44d) 



The formulae above show clearly that, as remarked in ^Il| it 
does make sense to consider the composite probability of the 
outcomes of the two temporally separated observations, as a 
composite probability needs have nothing to do with the fact 
that the observations are performed "simultaneously" or not. 

From the joint probabilities, we can compute all probabili- 
ties involved in this set-up. Applying the marginal probability 
rule (I14a> and Bayes' rule ( I14ct we find: 



P(flup I m) = Pah + Pah' P(ado^n I m) = Pab + Pah 
P(bup I m) - Pah + Pah, P(b down I '«) = Pab + Pah 

Pab 



Piaup I ^up A m) 



P(adown I ^up A m) -- 
P(aup I ^down A m) = 

^'(fldown I ^down A m) 

P(bup I aup A m) 



Pab + Pab 
_ Pab 

Pab + Pab 
Pab 

Pab + Pab 
^ Pab 

Pah + Pab 

Pab 



Pibdov^n I flup A m) = 
P(bup I fldown A m) = 



Pab + Pab 
Pab 



Pibdov^n I fldown A m) = 



Pab + Pab 

Pah 

Pah + Pab ' 
Pab 



(45) 

(46) 
(47) 

(48) 

(49) 

(50) 



Pah + Pab 



Here the probabilities (I43l l have been re-written in terms of 
the joint probabilities. 

A look at Eqs. ( I47> and J48> shows that Bayes' rule also 
applies in "quantum experiments", and provides a counter- 
example to the incorrect statement that the expression 'a^p \ 
bup' would mean "b^p precedes Oup "■ 

We can proceed to calculate the Shannon entropies 



H[P(A I m)] = -K[{pab + Pah) MPab + Pah) + (Pah + Pab) MPab + Pab)]' (51) 

H[P(B I m)] = -K[(pab + Pab) MPab + Psfc) + (/?a5 + Pab) MPah + Pab)]' (52) 
as well as the conditional entropies 

H[P{B I A A m)] = 7^ (/,,, + p,j) ( In —^2^^ In — ^^i^) 

^ Pab + Pah Pab + Pab Pah + Pab Pab + Pab' 
, T^l , Pab , Pab Pah , Pah \ 

+ K(pab + Pa-h)\ In In 

^ Pab + Pab Pah + Pah Pah + Pab Pah + Pab' 

= K [-pah In Pah - Pah In pab - Pah ^npab - Pah ^npah 

+ (Pab + Pab)MPab + Pah) + (Psb + Pab) MPah + Pab)]' (53) 
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H[P{A \BAm)]=K {pab + Pab) ( — 

V Pab + 



Pab . Pab Pab , Pah 

In In 



Pab + Pab Pab + Pab Pab + Pab Pab + Pab ' 
,1^/ , Pah , Pab Pab , Pab \ 

+ K(Pah + Pab){ ; In ; \ In \ 

^ Pah + Pah Pah + Pah Pah + Pab Pab + Pab' 
= K [-pab In Pab - Pab Psb ' Pab ^^Pab ' Pah ^^Pab 

+ (Pab + Pab) MPah + Pah) + (Pah + Pab) MPab + Pab)] - (54) 

where the expressions have been simpHfied making use of the additivity property of the logarithm. 
Finally, from Eqs. i5l\ and i53\ . i52\ and i54\ . we find 

H[P{A I to)] + H[P{B I A A to)] = H[P(B \ to)] + H[P(A \ B A to)] 

^ -K (pab In Pah + Pah In Pah + Pab In Pah + Pah In Pah) (55) 
= H[P{A A B \ m)]. 



whence we see that the property jl6b> is satisfied (we find, 
e.g., that H[P{A \ to)] + H[P{B \ A A to)] = H[P{B \ to)] + 
H[P{A I B A to)] = H[P{A A B I to)] ^ 1 .20 bit, for a = njA). 

We note that the way in which Eq. \55l has been found 
does not depend on the numerical values of the probabilities 
{pij], but only on the additivity property of the logarithm; so 
the calculations above can be seen as a mathematical proof of 
the property \\6h\ for the special case of sets with only two 
alternatives. 

If we invert the positions of the two Stern-Gerlach appara- 
tus, placing the one oriented along x (associated to the set B) 
before the one oriented along a (associated to the set A), we 
then realise a new, difi^erent experimental arrangement, which 
can be denoted by TOinv. The probabilities for the sets of al- 
ternatives A and B will thus difi^er from those in to: we have 
indeed 

^'(^up I '«inv) = 5, P(bdov,n I »Jinv) = 5, (56a) 
^'(flup I ^up A TOinv) = Sin^(f - f ), 

(56b) 

^'(adown I ^up A TOinv) = COS (f - f ), 
^'(flup I ^down A TOinv) = COS^(| - f), 

(56c) 

^'(adown I ^down A TOinv) = Sm (| - f ). 

It is clear that, from this point on, we can proceed as in the 
analysis of the first set-up, obtaining 

//[P(A|TO,„v)]+i/[P(B|A ATOinv)] 

= H[P(B I TOinv)] + H[P(A I B A TOinv)] 

= -K (P'ab In Pab + P'ab In P'ab ^^^^ 

+ P'ab^''P'ab+P'ab^''P'ab^^ 
= H[P{A A B I TOinv)] 

where the [p'-j] are the values of the joint probabilities {f (a, A 
bj I TOinv)}, different, in general, from the {pij}- In any case, 
property ( I16bt is again satisfied in the new context (we have, 

e.g., H[P(A I TO,„v)] + H[P(B I A A TO.nv)] = H[P(B \ TOinv)] + 

//[P(A|BATOinv)] = i/[P(AAB|TOinv)] ~ 1 .60 bit, for a = ji/4). 



Thus we have found no inconsistencies in this second ex- 
periment either: Equation i42\ . from Ref. [10], is simply in- 
correct, and can more correctly be written as 

H[PiA I to)] -I- H[P(B I A A to)] 

^ H[PiB I TOinv)] + H[P(A I B A TO,„v)], (58) 

or equivalently and more briefly as 

H[P(A A B I to)] 5t H[P(A A B I TOinv)], (59) 

and its content is that the Shannon entropies in the experiment 
TO are in general different from those in the different experi- 
ment TOinv. This does not surprise us, since the experiments 
have diff'erent set-ups. 

The fact that Eq. J42> does not pertain to property ( I16bl l 
is not noticed in Ref. |10], again because the contexts are 
not kept explicit in the notation. Moreover, the expression 
"//(B I A)" is considered there as implying that the observation 
corresponding to A is performed before the one corresponding 
to B:" it is for this reason that, in order to calculate "//(Z?|A)", 
the authors consider the experiment in which the observation 
corresponding to A is performed before the one correspond- 
ing to B, but then, in order to calculate "H(A \ B)", they feel 
compelled to change the order of the observations — with the 
only effect of changing the whole problem and all probabili- 
ties instead! But, as already remarked, the conditional symbol 
'I' does not have that meaning. The point is that the temporal 
order of acquisition of knowledge about two physical events 
does not necessarily correspond to the temporal order in which 
these events occur. 

C. Classical experiment 

Together with the two quantum experiments, the authors of 
Ref. 110,1 also present an example of a classical experiment 



' ' Cf. the discussion of the formula "//(A) + H{B \A) = H(B) + H(A | B)" in 
Ref §111]. 
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in which, they claim, the property ( I16b> is not violated. It 
is useful to re-analyse this example as well, in order to show 
that, in fact, it is not an instance of confirmation of the prop- 
erty ( ll6bL because it, too, involves two different experimental 
arrangements. 

The idea Fig. 3] is as follows. We fill a box with four 
balls of different colours (black and white) and compositions 
(plastic and wood). There are two black plastic balls, one 
white plastic ball, one white wooden ball. We shake the box, 
draw a ball blindfold, and consider the set A = {flbiack^ «whitel 
for the ball's being black or white. If the ball is black, then 
we put all black balls in a new box, draw a new ball from this 
box, and consider the set B = {^plastic, ^wood) for this ball's 
being plastic or wooden. We proceed analogously if the first 
drawn ball was white instead.'^ Let us denote the set-up just 
described by n. 

The probabilities of first drawing a black or a white ball are 
respectively P(flbiack I «) = 1/2 and /"(dwhite \n) - 1/2 and thus 
their Shannon entropy is 

H[P(A I «)] = i) = 1 bit. (60) 

The conditional probabilities of the second drawn ball's being 
plastic or wooden, given the outcome of the first observation, 
are 

P(bpVastic I aWack A n) = 1, P(b„ood I flblack A «) = 0, (61) 

if the first result was 'black', and 

^'(^plastic I fwhite A n) = i, P(b^ood I Awhite A «) = i, (62) 

if it was 'white'. From these probabilities the following Shan- 
non conditional entropy can be computed: 

i/[P(B|AA«)] = ii/(l,0)+i//(i,i) 

= 5 X bit -I- 5 X 1 bit = 0.5 bit. 

Combining Eqs. (I60> and (I63> we obtain 

H[P(A I n)] + H[P(B I A A «)] = 1 .5 bit. (64) 

Now we suppose to make the observations in inverse order 
instead. We shake the initial box, draw a ball, consider first the 
set B = {^piasdcj ^woodl for the ball's being plastic or wooden. 
Depending on the outcome we fill a new box either with the 
plastic or the wooden balls, and draw a new ball; then we 
consider the set A = {flbiack, flwhite) for the new ball. Let us 



In Brukner and Zeilinger's original example, the black and white balls are 
put into two separate boxes after the first draw, and a ball is drawn /rom 
each box separately. But these two final draws ai'e then related to two sepa- 
rate observations, not one, so that in total we have three observations in this 
experiment, and the formula I16bi is not even appropriate. The experiment 
has thus been modified here, in order to preserve the two authors' original 
intention. 



denote this new experiment by ninv. It is clear that n and ninv 
are really different experiments, for the following reason: in 
n, between the two draws, the second box contains black or 
white balls; while in «inv it contains plastic or wooden balls. 

The probabilities for the set B this time are P(fepia.stic I "inv) = 
I and P(fewood I Winv) = |, with an entropy 

//[P(B I «„„)] 1)^0.81 bit, (65) 

while the conditional probabilities for A are 

^'(ablack I Aplastic A Hinv) = |, 

1 (66) 

P{<^whke I Aplastic A Hinv) — 3, 

if the first result was 'plastic', and 

^'(flblack I ^wood A Hinv) = 0, ^^^^ 
^'(Qwhite I ^wood A Hinv) — 1 , 

if it was 'wooden'. The conditional entropy is 

H[PiA I B A «i„v)] = 1 Hil i) + i mo, 1) 

«lx 0.92 bit H- i X bit (68) 
« 0.69 bit, 

and adding this time Eqs. (I65> and ( I68> . we find 

H[P(B I «i„v)] + H[P(A I B A «i„v)] = 1 .5 bit. (69) 
We see that the entropies (I64t and ( I69l l are equal, 

H[P(A\m„,)] + H[P(B\AAni,,)] 

= H[P{B I «;„,)] + H[P{A I B A «,„v)] , (70) 

which is equivalent to 

H[P(AAB\n)]^H[P(AAB\niny)], (71) 

but it is clear that this is not the statement of property (I16bl l. 
because the right- and left-hand sides of this equation refer to 
two different experiments. The content of the equality above 
is only that the Shannon entropies for the probability distribu- 
tions for the composite set of alternatives A h B are equal in 
the two experiments n and Winv- 

IV. THE ROLE OF "QUANTUMNESS" 

We have shown thus far that the quantum experiments in 
Ref. 1 10| did not involve any violation of the Shannon en- 
tropy's properties. However, we may still imagine someone 
raising the following argument: 

"Very well, the properties ( I16> are not vio- 
lated in any experiment. But one notices that the 
equality /or different contexts 

H[P(AAB\n)]^H[P(AAB\nind], (EDr 
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while holding in the classical experiment of 
^111 CI does not hold in general in the quantum 
one of ^111 B I where it is found instead that 

H[P{A A B I m)] + H[P(A A B \ minv)] . (|59}r 

From this particular case, one can see that in clas- 
sical experiments the Shannon entropy remains 
the same if the temporal order of observations 
is changed, whereas in quantum experiments the 
entropy changes together with the change in tem- 
poral order. This phenomenon is thus a pecu- 
liarity of the quantum nature of the experiments 
— a sort of 'quantum-context-dependence' of the 
Shannon entropy." 

But this argument has no validity, of course. The Shan- 
non entropy is always "context dependent", and this comes 
from the fact that probabilities are always context dependent, 
in both classical and quantum experiments. We can further 
and illustrate this fact by means of two more experiments, 
which will also serve as counter-examples of the ones already 
discussed. 



The Shannon entropies are 

H[P(A \k)]^H(\,j)^ 0.92 bit, 

H[P(B \AAk)] = \H(l,0)+j H{\, \) 

= i X bit -H I X 1 bit 0.67 bit, 

and their sum is 

H[P{A I k)] + H[P(B \AAk)] 

= H[P(B I k)] + H[P(A \BAk)] 
= H[P(AAB\k)] ^ 1.58 bit. 



(75) 



(76) 



(77) 



Now let us consider the set-up ki„v, in which the observa- 
tions are made in reverse order, but with the same general pro- 
cedure. The probabilities are then: 



^( Aplastic I ^inv) — 35 ^(^wood I ^inv) — 

^'(ablack I Aplastic A ^inv) = 
^('^white I Aplastic A ^inv) — 

^(«black I ^wood A ^inv) - 
Pia^hhe I ^wood A ^inv) = 1 



2 

3' 
I 

3' 



(78) 

(79) 
(80) 

(81) 
(82) 



A. First counter-example 

The first counter-example is a modification, based on the 
examples presented by Kirkpatrick 1 38, 39] of the experiment 
with the balls discussed in ^111 CI The balls are in addition 
big or small as well now: there are one big black plastic ball, 
one small black plastic ball, one small white plastic ball, and 
one small white wooden ball. 

Initially, we prepare the box so that it contains only all small 
balls. Then we shake the box, draw a ball blindfold, and con- 
sider the set A = {flbiack, flwhite) for the ball's being black or 
white. If the drawn ball is black, then we prepare the box so 
that it contains only all black balls (also the big black one that 
was initially not in the box), draw a new ball from this box, 
and consider the set B = {^plastic, ^wood I for this ball's being 
plastic or wooden. We proceed analogously if the first drawn 
ball was white. '"^ It is easy to see that, in the set-up just de- 
scribed, denoted by k, we have the following probabilities: 

^'(flblack \k) ^ \, ^'(flwhite I fc) = |, (72) 
^'(^plastic I flblack A fe) = 1, Pib^ood I Ablack A ^) = 0, (73) 
P(bplislic I flwhite Ak) - ^, P(b„ood I flwhite A ^) = 5. (74) 



These lead to the entropies 

H[P(B I ^i„v)] = H(l I) ^ 0.92 bit, (83) 

H[P{A I B A kin.)] = I Hil i) + I H(0, 1) 

« I X 0.92 bit H- i X bit (84) 
^ 0.61 bit, 

and the sum 

H[P(B\ki,,)]+H[P(A\BAki,,)] 

= H[P(A I ^i„,)] + H[P(B I A A ki,,)] 
= //[P(AAB|A;inv)] ~ 1.53bit. 

Comparing Eqs. J77> and ( I85> we find 

H[PiA AB\k)]i^ H[P(A A B I ^inv)] . (86) 

Hence, for this experiment, of a clearly classical nature, we 
obtain different statistics and entropies depending on the order 
in which we observe colour and composition.'^ 

Note, in any case, that in each of the two set-ups — 
Eqs. ( I77> and (I85> — the properties il6\ are always satisfied, 
just as in the quantum experiments. 



Cf. also the toy models discussed by Hardy |4(J and Spekkens |41||. 
The Reader should not, too simply, identify the "system" here with some 
specific group of balls. It is rather associated with a variable collection 
of balls, in analogy with a classical open system associated with variable 
number (and species) of particles. 



We may note, incidentally, that it has long been known that the thermody- 
namic entropy of a classical thermodynamic system in a non-equilibrium 
state depends on its complete previous history ||42|| . 
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B. Second counter-example 

The second counter-example is of a quantum-mechanical 
nature. It runs precisely like the experiment with spin- 1/2 par- 
ticles discussed in ^111 Bl except that now the particle has ini- 
tially spin up, not along the z axis, but along the axis b that 
bisects the angle a~x (i.e., b lies in the xaz plane and forms an 
angle /? = nlA-ajl with both the x and a axes; remember that 
a is the angle az). The analysis of this experiment proceeds 
completely along the lines of the re-analysis of ^111 B I if we 
introduce the contexts q and q'inv, and change Eqs. ( I43> with 



^'(flup I q) = cos^ |, P(fldown I q) = sin^ |, 
^'(^up I fliip A ^) = cos^/3, 



P(b, 



down I "up 



flup f^q) - sin p. 



P(bnp I fldown A ^) = sin yS, 

P(bdown I fldown A g') = COS^yS, 



and Eqs. j56> with 

^'(^up I qim) = COS 



2 P 



P(b down I q-mw) = sin 



^'(flup I bup A ^inv) = sin p, 

^'(fldown I ^up A Q'inv) = COS^/?, 
-P(flup I ^down A qiny) = COS^yS, 

^'(adown I ^down A ^i„v) = sin^. 



(87a) 
(87b) 

(87c) 

(88a) 
(88b) 

(88c) 



It is obvious that this leads to the equalities P(fl, A bj \ q) - 
Piflj A bj I q'inv) and eventually to the equality 



H[P{A hB\q)]^ H[P(A A B I ^inv)], 



(89) 



exactly as it happened in the experiment with the balls of 
jnm (for a = ji/4, e.g., we have H[P(A A B | ^)] ^ 0.83 bit). 
But here the experiment is a purely quantum one: we see in- 
deed that the observables do not commute here, the initial state 
is pure, and its density matrix is not diagonal in either of the 
observables' bases. Compare this result with the discussion in 
Ref. S. 



V. CONCLUSIONS 

We have shown that the properties of the Shannon entropy 
are not violated in^quantum experiments, contrary to the con- 
clusions of Ref. 1 10|. In that paper, an idiosyncratic temporal 
interpretation of the conditional symbol '|' and of the conjunc- 
tion symbol 'A' leads the authors to change experimental set- 
ups in order to calculate various entropies. As a result, they 
compare entropies relative to different experimental arrange- 
ments instead, and do not notice that the probabilities are also 
different, and so their results (Eqs. ( I19> and J42> in this paper), 
besides not being formally correctly written, do not pertain to 



the properties of the Shannon entropy (Eqs. M6V ). which refer 
to a single, well-defined experiment and hold unconditionally. 

The peculiar results arising from the comparison of en- 
tropies relative to different experimental contexts can thus ap- 
pear or not appear in any kind of experiments, classical as well 
as quantum; this has been shown by means of two counter- 
examples: a classical one, in which a change in the temporal 
order of the experiment leads to a change in entropy values, 
and a genuinely quantum one, in which the same temporal 
change leads to no entropy changes. 

A conclusion is that Bohr's dictum ought to be observed 
also in mathematical notation, and not only in the analysis of 
quantum phenomena, but in the analysis of classical phenom- 
ena as well, because probabilities — and, consequently. Shan- 
non entropies — always depend on "the whole experimental 
arrangement" iKll taken into account. 
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